Parsimony, likelihood, and simplicity
نویسنده
چکیده
The latest charge against parsimony in phylogenetic inference is that it involves estimating too many parameters. The charge is derived from the fact that, when each character is allowed a branch length vector of its own (instead of the homogeneous branch lengths assumed in current likelihood models), the results for likelihood and parsimony are identical. Parsimony, however, can also be derived from simpler models, involving fewer parameters. Therefore, parsimony provides (as many authors had argued before) the simplest explanation of the data, or the most realistic, depending on one s views. If (as argued by likelihoodists) phylogenetic inference is to use the simplest model that provides sufficient explanation of the data, the starting point of phylogenetic analyses should be parsimony, not maximum likelihood. If the addition of new parameters (which increase the likelihood) to a parsimony estimation is seen as desirable, this may lead to a preference for results based on current likelihood models. If the addition of parameters is continued, however, the results will eventually come back to the same place where they had started, since allowing each character a branch length of its own also produces parsimony. Parsimony can be justified by very different types of models— either very complex or very simple. This suggests that parsimony does have a unique place among methods of phylogenetic estimation. 2003 The Willi Hennig Society. Published by Elsevier Science (USA). All rights reserved. The two most widely used criteria for phylogenetic inference are parsimony and maximum likelihood. Usually, parsimony is defended by recourse to realism, generality, and economy of assumptions, while maximum likelihood is defended by its explicit use of evolutionary models and the idea that phylogenetic inference must be viewed exclusively as a problem in statistical inference. Parsimony, under some specific models, is also a maximum likelihood estimator; as noted by Farris (1986, p. 24), the ‘‘method of maximumlikelihood is not a technique for estimating anything in particular, but a way of deriving estimation procedures from models.’’ The most widely used maximum likelihood methods are now the ones based on the work of Felsenstein (1973, 1981), reviewed in Swofford et al. (1996), which assume stochastic, Markovian models of evolution, where all the sites have the same probability of change along a branch (a limited amount of rate variation is allowed in some models; see Yang, 1993, 1994). This probability depends on the ‘‘length’’ of the branch (time and mutation rates combined). These models are derived from neutral theories of evolution, which assume that only time and mutation rate are the forces behind most of molecular evolution. Throughout this paper, the term ‘‘likelihood’’ (or ‘‘likelihoodist’’) is used to denote this type of method (or the people espousing its use). Parsimony was not originally justified by means of an explicit probabilistic model. In the belief that only methods based on explicit probabilistic models are defensible, the likelihoodists have tried to discover (starting with Felsenstein, 1978) ‘‘the model’’ implicit in parsimony; the resulting findings have been used to criticize the assumptions supposedly required to justify parsimony. Some authors have defended parsimony from a philosophical perspective (Kluge, 1997, 2001; Siddall, 1997, 2002), but most likelihoodists (with few exceptions, such as de Queiroz and Poe, 2001) have ignored these philosophical issues. The purpose of the present paper is to revise some aspects of the parsimony vs likelihood controversy, from a more statistical perspective. Recent criticisms of parsimony accuse it of relying Cladistics 19 (2003) 91–103 www.elsevier.com/locate/yclad Cladistics E-mail address: [email protected]. 0748-3007/03/$ see front matter 2003 The Willi Hennig Society. Published by Elsevier Science (USA). All rights reserved. doi:10.1016/S0748-3007(03)00017-3 on an implicit model that is too complex and therefore over-fits the data (this charge has now replaced earlier criticisms that accused it of being too simplistic—e.g., Felsenstein, 1982, p. 388; Felsenstein, 1988, p. 535). My main conclusion is that, since most parsimonious trees are maximum likelihood estimates under different models (either very simple or very complex), parsimony can be seen as providing either the simplest explanation of the data or the most realistic. Current likelihood methods lie in between. More importantly, the fact that parsimony can be derived under very different types of models also casts doubt on the notion that one can evaluate a method justified on logical grounds by simply evaluating statistical models that happen to produce similar results.
منابع مشابه
Key Concepts in Model Selection: Performance and Generalizability.
What is model selection? What are the goals of model selection? What are the methods of model selection and how do they work? Which methods perform better than others and in what circumstances? These questions rest on a number of key concepts in a relatively underdeveloped field. The aim of this paper is to explain some background concepts, to highlight some of the results in this special issue...
متن کاملSimplicity in science
This dissertation investigates the possibility of justifying simplicity principles in science. The labor of these projects is organized into three chapters. The first chapter introduces some of the key authors and issues in the history of simplicity in science. This chapter also gives a detailed discussion of the work of the 19th century physicists Le Verrier and Newcomb who played a crucial ro...
متن کاملFailed refutations: further comments on parsimony and likelihood methods and their relationship to Popper's degree of corroboration.
Kluge's (2001, Syst. Biol. 50:322-330) continued arguments that phylogenetic methods based on the statistical principle of likelihood are incompatible with the philosophy of science described by Karl Popper are based on false premises related to Kluge's misrepresentations of Popper's philosophy. Contrary to Kluge's conjectures, likelihood methods are not inherently verificationist; they do not ...
متن کاملDynamic homology and the likelihood criterion
The use of likelihood as an optimality criterion is explored in the context of dynamic homology. Simple models and procedures are described to allow the analysis of large variable length sequence data sets, alone and in combination with qualitative information (such as morphology). Several approaches are discussed that have different likelihood interpretations in terms of maximum parsimony like...
متن کاملPhilosophy and phylogenetic inference: a comparison of likelihood and parsimony methods in the context of Karl Popper's writings on corroboration.
Advocates of cladistic parsimony methods have invoked the philosophy of Karl Popper in an attempt to argue for the superiority of those methods over phylogenetic methods based on Ronald Fisher's statistical principle of likelihood. We argue that the concept of likelihood in general, and its application to problems of phylogenetic inference in particular, are highly compatible with Popper's phil...
متن کامل